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Abstract — The construction of asymmetric error correcting 
codes is a topic that was studied extensively, however, the existing 
approach for code construction assumes that every codeword 
should tolerate t asymmetric errors. Our main observation is that 
in contrast to symmetric errors, asymmetric errors are content 
dependent. For example, in Z-channels, the all-1 codeword is 
prone to have more errors than the all-0 codeword. This motivates 
us to develop nonuniform codes whose codewords can tolerate 
different numbers of asymmetric errors depending on their 
Hamming weights. The idea in a nonuniform codes' construction 
is to augment the redundancy in a content-dependent way and 
guarantee the worst case reliability while maximizing the code 
size. In this paper, we first study nonuniform codes for Z- 
channels, namely, they only suffer one type of errors, say 1 — >■ 0. 
Specifically, we derive their upper bounds, analyze their asymp- 
totic performances, and introduce two general constructions. 
Then we extend the concept and results of nonuniform codes to 
general binary asymmetric channels, where the error probability 
for each bit from to 1 is smaller than that from 1 to 0. 

Index Terms — Nonuniform Codes, Asymmetric Errors, Coding 
for Data Storage, Bounds and Constructions. 



I. Introduction 

ASYMMETRIC errors exist in many storage devices H. 
In optical disks, read only memories and quantum mem- 
ories, the error probability from 1 to is significantly higher 
than the error probability from to 1, which is modeled by 
Z-channels where the transmitted sequences only suffer one 
type of errors, say 1 — > 0. In some other devices, like flash 
memories and phase change memories, although the error 
probability from to 1 is still smaller than that from 1 to 
0, it is not ignorable. That means both types of errors, say 
1—^0 and — > 1 are possible, modeled by binary asymmetric 
channels. In contrast to symmetric errors, where the error 
probability of a codeword is context independent (since the 
error probability for Is and 0s is identical), asymmetric errors 
are context dependent. For example, the all-1 codeword is 
prone to have more errors than the all-0 codeword in both 
Z-channels and binary asymmetric channels. 

The construction of asymmetric error correcting codes is a 
topic that was studied extensively. In lfl4ll . Kl0ve summarized 
and presented several such codes. In addition, a large amount 
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of efforts are contributed to the design of systematic codes 
ID, Q, constructing single or multiple error-correcting codes 
CD, OH, QD, increasing the lower bounds Q-QD, El and 
applying LDPC codes in the context of asymmetric channels 
||2TI . However, the existing approach for code construction is 
similar to the approach taken in the construction of symmetric 
error-correcting codes, namely, it assumes that every codeword 
could tolerate t asymmetric errors (or generally t± 1 —} 
errors and t?. —> 1 errors). As a result, different codewords 
might have different reliability. To see this, let's consider 
errors to be i.i.d., where every bit that is a 1 can change 
to a by an asymmetric error with crossover probability 
p > and each bit that is a keeps unchanged. For a 
codeword x = (xi, x%, . . . , x n ) £ {0,1}", let w(x) = 
\{i : 1 < i < n, Xi = 1}\ denote the Hamming weight of x. 
Then the probability for x to have at most t asymmetric errors 
is 

«w=t( ,fl(x V 1 -^ w ''' 

i=0 x ' 

Since x can correct t errors, P*(x) is the probability of 
correctly decoding x (assuming codewords with more than 
t errors are uncorrectable). It can be readily observed that 
the reliability of codewords decreases when their Hamming 
weights increase, for example, see Fig. Q] 

While asymmetric errors are content dependent, in most 
applications of data storage the reliability of each codeword 
should be content independent. Namely, unaware of data 
importance, no matter what content is stored, it should be 
retrieved with very high probability. The reason is that once 
a block cannot be correctly decoded, the content of the 
block, which might be very important, will be lost forever. 
So we are interested in the worst-case performance rather 
than the average performance that is commonly considered in 
telecommunication, and we want to construct error-correcting 
codes that can guarantee the reliability of every codeword. In 
this case, it is not desired to let all the codewords tolerate 
the same number of asymmetric errors, since the codeword 
with the highest Hamming weight will become a 'bottleneck' 
and limit the code rate. We call the existing codes uniform 
codes while we focus on the notion of nonuniform codes, 
namely, codes whose codewords can tolerate different numbers 
of asymmetric errors depending on their Hamming weights. 
The goal of introducing nonuniform codes is to maximize the 
code size while guaranteeing the reliability of each codeword 
for combating asymmetric errors. 

In a nonuniform code, given a codeword x G {0, 1}™ of 
weight w, we let t\_{w) denote the number of 1 —> errors that 
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Fig. 1. The relation between Pt(x) and ui(x) when p = 0.1 and t = 2. 



x has to tolerate, and we let tf(w) denote the number of —> 1 
errors that x has to tolerate. Both t± and if are step functions 
on {0, 1, n} that can be predetermined by the channel, the 
types of errors and the required reliability. In this paper, we 
consider t± a nondecreasing function and i-j- a nonincreasing 
function of codeword weight. As a result, we call such a code 
as a nonuniform code correcting [ij.,^] errors. In particular, 
for Z-channels where t^(w) = for all < w < n, we 
call it a nonuniform code correcting t± asymmetric errors. 
Surprisingly, while nonuniform codes seem to be a natural 
idea (especially in data storage applications), they were not 
studied in the literature. 

Example 1. In Z-channels, let p be the crossover probability 
of each bit from 1 to and let q e < 1 be maximal tolerated 
error probability for each codeword. If we consider the errors 
to be Ltd., then we can get 



i{s 6 N\ 



i=0 



p \l- p )^>l- qe ) (1) 



for < w < n. In this case, every erroneous codeword can 
be corrected with probability at least 1 — q e . □ 

The following notations will be used throughout of this 
paper: 

q e the maximal error probability for each codeword 

p, pi the error probability of each bit from 1 to 
Pf the error probability of each bit from to 1 

t± a nondecreasing function that indicates 

the number of 1 — > errors to tolerate 
tf a nonincreasing function that indicates 

the number of — > 1 errors to tolerate 



In this paper, we introduce the concept of nonuniform 
codes and study their basic properties, upper bounds on the 
rate, asymptotic performance, and code constructions. We first 
focus on Z-channels and study nonuniform codes correcting 
fi asymmetric errors. The paper is organized as follows: In 
Section [TT] we provide some basic properties of nonuniform 
codes. In Section HTT1 we give an almost explicit upper bound 



for the size of nonuniform codes. Section [TVl studies and com- 
pares the asymptotic performances of nonuniform codes and 
uniform codes. Two general constructions, based on multiple 
layers or bit flips, are proposed in Section [V] and Section [VI] 
Finally, we extend our discussions and results from Z-channels 
to general binary asymmetric channels in Section [VTll where 
we study nonuniform codes correcting [tj., if] errors, namely, 
1 — 5- errors and i-j- — > 1 errors. Concluding remarks are 
presented in Section IVIIII 

II. Basic Properties of Nonuniform Codes for 
Z-Channels 

Storage devices such as optical disks, read-only memories 
and quantum atomic memories can be modeled by Z-channels, 
in which the information can suffer a single type of error, 
namely 1 — > 0. In this section, we study some properties of 
nonuniform codes for Z-channels, namely, codes that only cor- 
rect ti asymmetric errors. Typically, t^(w) is a nondecreasing 
function in w, the weight of the codeword. We prove it in the 
following lemma for the case of i.i.d. errors. 

Lemma 1. Assume the errors in a Z-channel are i.i.d., then 
given any < p, q e < 1, the function t\, defined in (17]) satisfies 
t^w + l) -t±(w) £ {0,1} for all0<w<n-l. 

Proof: Let us define 



Then 



P(k, w, p) = (w — k) 



which leads us to 



l-p 



■j-fc-i 



(1 - t) k dt, 



P(k,w,p) — P(k, w + l,p) 
k + 1 

-[P(k + l,w + l,p)-P(k,w 



+ l,p)]. (2) 



w + 1 

First, let us prove that t±(w + 1) > t±(w). Since 

P(k + l,w + l,p)-P{k,w + l,p) >0, 

we have P(k, w,p) > P(k, w + l,p). 

We know that P(t±(w + 1), w + l,p) > 1 — q e , so 

P(t±(w + l),w,p) >l-q e . 

According to definition of t±(w), we can conclude that t±(w + 

i) > hH- 

Second, let us prove that t^(w + 1) — < 1. Based on 

equation ©, we have 

P{k,w,p) - P(k + l,w+ l,p) 
w — k , 



1 



[P(k,w + l,p) -P(k + l,w + l,p)j. 



So P(k, w,p) < P(k + l,w + l,p). 

We know that P(ti(w), w,p) > 1 — q e , therefore 

P(t l (w) + l,w + l,p) >l-q e . 



3 



According to the definition of t^iy + l), we have tj_(w + l) < 

h(w) + 1. 

This completes the proof. ■ 

Given two binary vectors x = {x\, . . . ,x n ) and y = 
... , y n ), we say x < y if and only if x l < y l for all 
1 < % < n. Let £>(x) be the (asymmetric) 'ball' centered at x, 
namely, it consists of all the vectors obtained by changing at 
most t±(w(x)) Is in x into Os, i.e., 

23(x) = {v e {0, l}"|v < x and 2V(x, v) < ^(u)(x))}, 

where iu(x) is the weight of x and 

N(x,y)±\{i:x i = l,y i = 0}\. 

We have the following properties of nonuniform codes as 
the generalizations of those for uniform codes studied in [ 14 ]. 

Lemma 2. Code C is a nonuniform code correcting fj. 
asymmetric errors if and only if £>(x) p|£>(y) = <fr for all 
x, y £ C with x^y. 

Proof: According to the definition of nonuniform codes, 
all the vectors in £>(x) can be decoded as x, and all the vectors 
in B(y) can be decoded as y. Hence, £>(x)p|£>(y) = <f> for 
all x, y £ C. ■ 

Lemma 3. There always exists a nonuniform code of the 
maximum size that corrects t\, asymmetric errors and contains 
the all-zero codeword. 

Proof: Let C be a nonuniform code correcting asym- 
metric errors, and assume that 00. ..00 ^ C. If there exists a 
codeword x £ C such that 00. ..00 £ £>(x), then we can get 
a new nonuniform code C' of the same size by replacing x 
with 00... 00 in C. If there does not exist a codeword x £ C 
such that 00. ..00 £ $(x), then we can get a larger nonuniform 
code C by adding 00. ..00 to C. ■ 

Given a nonuniform code C, let A r denote the number of 
codewords with Hamming weight r in C, i.e., 

A r = |{x£ C\w(x) =r}\. 

Given a nondecreasing function t±, let R r denote a set of 
weights that can reach weight r with at most 4j, asymmetric 
errors, namely, 

Rr = {0 < s < n\s - ti(s) < r < s}. 

Lemma 4. Let C be a nonuniform code correcting t± asym- 
metric errors. For < r < n, we have 



E 

jeRr 



>><-{: 



(3) 



Proof: Let V r = {x £ {0, l}"|w(x) = r} be the set 
consisting of all the vectors of length n and weight r. If x £ C 
with w(x) = j £ R r , according to the properties of t^, B(x) 
contains ({) vectors of weight r, namely 



|F r f|B(x)| = 



According to Lemma |2] we know that Uxgc(^ D ^( x )) ^ s a 
disjoint union, in which the number of vectors is 



E 



Since Uxgc(^" D ^( x )) — an< ^ tnere are at most 
vectors in V r , the lemma follows. 



V r J 



III. Upper Bounds 

Let B a (n,t) denote the maximum size of a uniform code 
correcting t asymmetric errors, and let Bp(n,t±) denote the 
maximum size of a nonuniform code correcting t± asymmetric 
errors, where t is a constant and t± is a nondecreasing function 
of codeword weight. In this section, we first present some 
existing results on the upper bounds of B a (n,t) for uniform 
codes. Then we derive an almost explicit upper bound of 
B/3( n ,tl) for nonuniform codes. 

A. Upper Bounds for Uniform Codes 

An explicit upper bound to B a (n,t) was given by Var- 
shamov |[T8l . In 0141 . Borden showed that B a (n,t) is upper 
bounded by 

min{A(n + t,2t + 1), (t + l)A{n, It + 1)}, 

where A(n,d) is the maximal number of vectors in {0, 1}™ 
with Hamming distance at least d. Goldbaum ifTD pointed out 
that the upper bounds can be obtained using integer program- 
ming. By adding more constrains to the integer programming, 
the upper bounds were later improved by Delsarte and Piret 
(6) and Weber et al. Il22l ll23l . Kl0ve generalized the bounds 
of Delsarte and Piret, and gave an almost explicit upper 
bound which is very easy to compute by relaxing some of 
the constrains iTOI . in the following way. 



Theorem 5. (131 For n > 2t > 2, let yo,yi, 
by 

1) yo = 1, 

2) y r = 0, VI < r < t, 

1 ,1 n 

.r 



.,y n be defined 



3) y t +r = 



(T) 

4) y n - r = y r , V0 < r < 



\ - fr + j 

3=0 

n 



J 



,Vl<r<--t, 



Then B a (n,t) < M a (n,t) = Y%=oVr- 

This method obtains a good upper bound to B a (n,t) 
(although it is not the best known one). Since it is easy 
to compute, when n and t are large, it is every useful for 
analyzing the sizes of uniform codes. 



B. Upper Bounds for Nonuniform Codes 

We now derive an almost explicit upper bound for the size 
of nonuniform codes correcting fj_ asymmetric errors, followed 
the idea of Kl0ve lfT3l for uniform codes. According to the 
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weight 



Fig. 2. This diagram demonstrates the relative values of r, g, k,m. 



lemmas in the previous section, we can get an upper bound 

of Bp(n,ti), denoted by Mp(n,t±), such that 

n 

Mp(n,t±) = maxy^ z r , 

i=0 

where the maximum is taken over the following constraints: 

1) z r are nonnegative real numbers, 

2) z = 1, 

3 ) £ (j)^<(;),V0<r<n. 

jERr v ' v 7 

Here, condition 2) is given by Lemma [3] and condition 3) is 
given by Lemma |4] Our goal is to find an almost explicit way 
to calculate Mp(n,t±). 

Lemma 6. Assume 53"=o z r ' s maximized over z , z\, z n 
in the problem above. If r = s — t±(s) for some integer s with 
< s, r < n, then 



Z r = £ 



Proof: Suppose that Z r < (™J for some r that satisfies 
the above condition. Let g = max Rr and fc = minjwlzu, > 
0, w > g}, as indicated in Figure [2] where a triangular denote 
the ball centered at the top vertex. Furthermore, we let m — 
max{w|fc — > w}. Note that in this case r = g — t±(g) 
and m = k — t^(k) — 1. 

We first prove that for all r < w < m, Z w < ("J . In order 
to prove this, we let s = w — r, then we get 



g 



= E^ 

j=w 

g-r 

= 



•r+j 



r+j 
r + s 



It is easy to obtain that 

+ j\ _ fr + j\ (1) 



So 



Z w < 



< 



< 



(g-r\ g-r 



(r-\-s^ 



( 9 7) 



E 

n 
r 



Z r -\-j 



(g - - r ~ !)---(g - r - s + 1) / n 
(n — r){n — r — l)...(n — r — s + 1) \r + s 
n 



Now, we construct a new group of real numbers 
Zq, z*, z* such that 



1) z* g = z g + A, 

2) z* k = z k - 5, 

3) z* = z r for r ^ h, r ^ k. 

with 

A = min({ VV - |t- < w < m }{j{^L Zk \m < w < 



1 



min{-pj|TO < u> < g} 



A. 



For such A, S, it is not hard to prove that Z* = (™) for 
< r < n. On the other hand, 

n n n 

53 < = E ^ + A - 6 > E z ^ 



r=0 r=0 



r=0 



which contradicts our assumption that J2r=o Zr 15 maximized 
over the constrains. So the lemma is true. ■ 

Lemma 7. Assume J2r=o Zr !S maximized over zq, z\, z n 
in the problem above. Ifr = s — t±(s) for some integer s with 
< s, r < n, then 



E 



J \ in 

Zi = 



where 



h = min{s G N\s — t^(s) = r}. 



Sketch of Proof: Let g = max{s G N\s — t^(s) = r}. If 
g = h, then the lemma is true. So we only need to prove it for 
the case that g > k. Similar to lemma|6] we assume Z r < (™), 
to get the contradiction, we can construct a new group of real 



numbers Zq , z\ , z* such that 



1) z* h = z h + A, 

2) z* = for h < w < g, 

3 ) z w = z w if w £ [h,g]. 



with 



r 

r + s 



A = min{ 



• rJ2j=h+l (w) : 



\r < w < h}. 
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For this Zq, z*, z*, they satisfy all the constrains and 



At the same time, it can be proved that 



n 



r=0 r=0 



which contradicts with our assumption that J2r=o z r i s max- 
imized over the constrains. This completes the proof. ■ 

Now let yo , yt, ■■ ., y n be a group of optimal solutions 
to zo,zi,...,z n that maximize J2r=o Zr - Then y a ,yi,...,y n 
satisfy the condition in Lemma [7] We see that yo = 1. Then 
based on Lemma|7] we can get yi, y n uniquely by iteration. 
Hence, we have the following theorem for calculating the 
upper bound Mp(n, ij,). 

Theorem 8. Let y , yi, y n be defined by 

i) yo = l, 



2) y 



l 



VI < r < n. 



n 



men B p (n,ti) < Mp{n,t x ) = X)JL 3/ r . 

This theorem provides an almost explicit expression for the 
upper bound Mp(n, ti), which is much easier to calculate 
than the equivalent expression defined at the beginning of 
this subsection. Note that in the theorem, we do not have a 
constrain like the one (constraint 4) in Theorem[5] It is because 
that the optimal nonuniform codes do not have symmetric 
weight distributions due to the fact that ti(w) monotonically 
increases with w. 



C. Comparison of Upper Bounds 

Here we focus on i.i.d. errors, i.e., given the crossover 
probability p from to 1 and the maximal tolerated error 
probability q e , the function t± is defined in equation ([TJl. In 
this case, we can write the maximum size of a uniform code 
as B a (n,ti(n)) — B a (n,p, q e ), and write the maximum size 
of a nonuniform code as Bp(n : t^(n)) = B/3(n,p,q e ). 

Now we let r] a (n,p,q e ) denote the maximal code rate 
defined by 



r) a {n,p, q e ) 



log B a (n,p,q e ) 



Similar, we let rjp(n,p,q e ) denote the maximal code rate 
defined by 

logBp(n,p,q e ) 

r]p{n,p,q e ) = . 

n 

By the definition of uniform and nonuniform codes, it is simple 
to see that r)p(n,p,q e ) > rj a (n,p,q e ). 

Figure [3] depicts the upper bounds of r) a (n,p,q e ) and 
T)8(n,p,q e ) for different values of p when n = 255 and 
q e = 10~ 4 . The upper bound of T) a (n,p, q e ) is obtained based 



0.9 
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Fig. 3. Upper bounds of the rates for uniform/nonuniform codes when 
n = 255, q e = 10" 4 . 



on the almost explicit upper bound given by Kl0ve, and the 
upper bound of r)p(n,p,q e ) is obtained based on the almost 
explicit method proposed in this section. It demonstrates that 
given the same parameters, the upper bound for nonuniform 
codes is substantially greater than that for uniform codes. 

IV. Asymptotic Performance 

In this section, we study and compare the asymptotic 
rates of uniform codes and nonuniform codes. Note that the 
performance of nonuniform codes strongly depends on the 
selection of the function t^. Here, we focus on i.i.d. errors, 
so given < p, q e < 1, we study the asymptotic behavior of 
r] a (n,p,q e ) and r)p(n,p,q e ) as n — > oo. By the definition of 
nonuniform and uniform codes, the 'balls' containing up to 
ij,(x) (or ti(n)) errors that are centered at codewords x need 
to be disjoint. 

Before giving the asymptotic rates, we first present the 
following known result: For any S > 0, when n is large 
enough, we have 

2 n(ff(±)-5) < < 2"t H '(£)+*) ) 

where H (p) is the entropy function with 

H(p) = plog — h (1 — p) log for < p < 1, 

p - l-p 



and 



H(p) = for p > 1 or p < 0. 



Lemma 9. Let A(n, d, w) be the maximum size of a constant- 
weight binary code of codeword length n, whose Hamming 
weight is w and minimum distance is d. Let R(n, t, w) be 
the maximum size of a binary code with Hamming weight w 
and codeword length n where every codeword can correct t 
asymmetric errors. Then 

R(n, t, w) = A(n, 2(t + 1), to). 
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Proof: Let C be a code of length n, constant weight w 
and size R(n, t, w) that corrects t asymmetric errors. For all 
x G {0, 1}™, let's define St(x) be the set consisting of all the 
vectors obtained by changing at most t Is in x into 0s, i.e., 

S t (x) = {v e {0,l} n |v < x and N(x,v) < t}. 

Then Vx. y e C, we know that 5 t (x) f| 5 t (y) = 0. 

Let u = (iti, . . . ,u n ) be a vector such that ui = 
min{xi,yi} for 1 < % < n. Then N(x, u) = A^(y, u) and 
u ^ 5 t (x) f)'S'i(y). W.l.o.g, suppose that u £ Sf(x). Then 
N(x, u) > i, and the Hamming distance between x and y is 

d(x, y) = N(x, u) + N(y, u) > 2(t + 1). 

So the minimum distance of C is at least 2(t + 1). As a result, 
A(n, 2(f + 1), w) > R(n, t, to). 

On the other hand, if a constant-weight code has minimum 
distance at least 2(i + 1), it can correct t asymmetric errors. 
As a result, R(n, t, to) > A(n, 2(t + l),w). ■ 



A. Bounds of lim, woo r\ a (n,p, q e ) 

Let us first give the lower bound of linin^oo r/ a (n,p, q e ) 
and then provide the upper bound. 

Theorem 10 (Lower bound). Given 0<q e <l,if0<p<j, 
we have 

lim T) a (n,p,q e ) > 1 - H(2p). 

n— >oo 

Proof: We consider uniform codes that correct t asym- 
metric errors, where 



* = min{,s| J2 ("W - > 1 - Qe}- 



According to Hoeffding's inequality, for any 5 > 0, as n 
becomes large enough, we have (p — S)n < t < (p + S)n. If 
we write t — jn, then p — S < 7 < p + 5 for n large enough. 

Since each codeword tolerates t asymmetric errors, we have 

B a (n,p, q e ) = B a (n, t) > R(n, t, to) = A(n, 2(t + 1), to), 

for every to with < w < n. The Gilbert Bound gives that 
(see Graham and Sloane Ifl2l ) 

A(n,2(t + l),io) > — f — r- 



Hence 



(;;) 



B a (n,p,q e ) > max — , 



> max 



w=0 v ( w ) ( r 

o 



n (W\ (71 — W\ 

w=0 nmax j€[0 ,t] (J( 4 ) 



> max 



„ : ^) >t nmax ie[0 , t] (») (7) 



For a binomial term (^) 
large enough, 



fc!(n-fc)! 



and (5 > 0, when n is 



2 n(H(i)-5) < < 2 n(ff( - )+5) . 

Let w = On and t = jn with < 7 < 1, as n becomes 
large enough, we have 

f] a (n,p, q e ) 
= -log 2 B a (n,p,q e ) 



> — log 2 max 



un /n—w\ 
t)\ t ) 



1 2 (ff(e)-,5)r l 
" n l0§2 9:8™ 5?)>7 „ 2 ( ff ^)+ 5 ) e ™2 (H( T^ )+5)(1 - e) " 

> max JT(0) - 6H{1) - (1 - 6)H{-^-) - 26 

1 . 1 

+ -log-. 

n n 

From 6<(1 - 9) > 7, we get 9 > 7 > 0; then is 
a continuous function of 7. As rt becomes large, we have 
P — 3 < 7 < p + S, so we can approximate H(^) with -ff (f )■ 
Similarly, we can approximate H(j^) with iJ(j^). Then 
we can get as n -> 00, 

> max H{6) - 0H&) - (1 - 9)H{-^-A. 

8:6(l-8)>p 1 — 

IfO <p < i,the maximum value can be achieve at 0* = -k. 
Hence we have 

lim i] a (n,p,q e ) > l-H(2p). 

n— >oo 

This completes the proof. ■ 
Theorem 11 (Upper bound). Given < p, q e < 1, we have 

lim v«(n,p,q e )<(l+p)[l-H(-2-)]. 

n->oo ■ L + p 

Proof: For a uniform code correcting t asymmetric errors, 
we have the following observations: 

1) There is at most one codeword with Hamming weight 
at most t; 

2) For t + 1 < w < n, the number of codewords with 

U- t ) 



Hamming weight w is at most 



GO ' 



Consequently, the total number of codewords is 
B a (n,p,q e ) < 1+51 ~pT 

" (n+t\ 

= i + v 



> max 



OOCT) 



< 



2 n+t 



(t) ' 



7 



So as n — > oo, we have 

Va(n,p,q, 



Then 



I 2™ +t 
< -loi[ m l 



(l + 7)"-ff( 



7 



1 + 7 



(l+p)[l-H( 



1+p 



)(l + 7) 

)]. 



where the last step is due to the continuousness of (1 + 7) — 
#(^(1+7) over 7. 

This completes the proof. ■ 

We see that when n — > 00, rj a (n,p,q e ) does not depends 
on q e as long as < q e < 1. It is because that when n — > 00, 
we have t — > pn, which does not depend on q e . This property 
is also hold by r]p(n,p,q e ) when n — > 00. 



B. Bounds of lim n _yoo rjp(n,p, q e ) 

In this subsection, we study the bounds of the asymptotic 
rates of nonuniform codes. Here, we use the same idea as that 
for uniform codes, besides that we need also prove that the 
'edge effect' can be ignored, i.e., the number of codewords 
with Hamming weight w <C n does not dominate the final 
result. 

Theorem 12 (Lower bound). Given < p, q e < 1, we have 

lim rjp{n,p,q e ) > max H (6)-dH(p)-(l-9)H(/—). 

Proof: We consider nonuniform codes that corrects t\_ 
asymmetric errors, where 

t^w) = min{ S | J2 ("V ( X - P) W ~ l ^ 1 - &>> 

for all < w < n. 

Based on Hoeffding's inequality, for any 6 > 0, as w 
becomes large enough, we have (p— S)w < t±(w) < (p + S)w. 
In another word, for any e, 6 > 0, when n is large enough and 
w > en, we have (p — 5)w < ti(w) < (p + 5)w. 

Let w = 9n and ti(w) — 710, then when n is large enough, 
if 9 > e, we have 

(p-S) <7< (p + S). 

If 9 < e, we call it the 'edge' effect. In this case < 7 < 1. 

Since each codeword with Hamming weight w can tolerate 
ti(w) errors, 

Bp(n,p, q e ) > R(n, ti(w), w) > A(n, 2{t] r {w) + 1), w), 

for every w with < w < n. 

Applying the Gilbert Bound, we have 

(n\ 

Bp(n,p,q e ) > max- 



Bp(n,p,q e ) > max- 



C) 



„ /w\ fn—w\ 



w max 



> 



max 



( w \ ( n—w\ 
4» n \t l (w)) \ ti (w)J 



When n —> 00, we have 

rip(n,p, q e ) 
= - log 2 Bp (n, p, q e ) 



> 



> 



2(B(9)-S)n 



— loev, max 

n oz e-.(i-0)>j n2 (ff(7)+<5)9«2 (ff( T^ )+l5)(:L_e) ™ 
max H{9) - 6H(j) - (1 - 0)H(-^-) 

8:(1 — 8)>7 1 — (7 

-2<5 + - log - 
n n 

= max fi(«)-W( 7 )-(l-«)ff(-L). 

8:(1 — 9)>7 1 — 

Note that when 9 < e for small e, we have 

70 



i?(0) - 9H(j) - (1 - 0)if( 



1 - 



0. 



So we can ignore this edge effect. That implies that we can 
write 

p — S < j < p + S, 



for any 9 with < 9 < 1. 

Since 1 — 9 > 7 > 0, for any fixed 9, 



70 



ff(0)-0#( 7 )-(l-0)ff(-^) 
is a continuous function of 7. As rt — >• 00, we have 



P 9 



V p(n,p,q e )> max H(0) - 6H(p) - (1 - 0)fr(-i_). 

0:(1 — 0)>p 1 — (7 

This completes the proof. ■ 



Theorem 13 (Upper bound). Given < p, g e < 1, we /iave 

) — 91 
s(p) 



lim r)p(n,p,q e ) < max _ff ((1 — p)9) — 9H(p) 

n— 'too 0<S<1 



1 



2»(p) -1- 1 ' 2 s (p) + 1 : 



with s(p) = H(p)/(1 - p). 



Proof: Here we use the same notations as above. Similar 
as the proof in Theorem Qj] given (n,p,q e ), the maximal 
number of codewords is 



Bp(n,p,q e ) < 1+ ^2 



( n ) 

\w-ti(w)J 



( w ) 

w=h(o)+i ^hM J 



E 

w=h(0) 



n \ 



(tjw)) 



n Vtu— t\(w)/ 



x 



TABLE I 

Upper bounds and lower bounds for the maximum rates of uniform codes and nonuniform codes 





Lower Bound 


Upper Bound 


lim n _>oo Va{n,p, q e ) 


[l-tf(2p)]J < p <i 


(l+p)[l-H(^)] 


limine r) (n,p,q e ) 


max <9<i- P H(6) - 6H(p) - (1 - e)H(^) 


ma Xo <g< 1 H{(l-p)9)-9H(p) 




icr 4 icr 3 icr 2 io~ 1 io° 

Crossover probability p 



Fig. 4. Bounds of lim,, 



,r] a (n,p,q e ) and lim n _Kx, r]p(n,p, q e ). 



As n — > oo, we have 



r] (n,p,q e ) 
■^og 2 Bfj(n,p, q e ) 



< 



n 
1 



log 2 



oir((l-7)e+5)n 

o<e<i n 2 ( - H( -^ e - s ^ 1 



max H((l - 7 )0) - 0i? (7) 
o<e<i 

max i2Y(l -7)6») -9HM. 
o<e<i 



25 



• logn 



Note that when < e for small e, we have 

#((1-7)0) -OHfr) ~ 0. 

So we can ignore the edge effect. That implies that we can 
write 

p — 6 < j < p + 5, 
for any 9 with < 9 < 1. 



Since for any fixed with < 9 < 1, £T((1 - 7)6*) - 9H(-y) 
is a continuous function of 7. When n — >■ 00, we have 

r}p(n,p,q e ) < m&x H((l - p)9) - 9H(p), 

< C6 < C1 



which equals to 



1 



s{p) 



2«(p) + 1 : 



v 2 s (p) + r 

with s(p) = H(p)/(1 -p). 
This completes the proof. 



C. Comparison of Asymptotic Performances 

Table U summarizes the analytic upper bounds and lower 
bounds of lim^oo r) a (n,p, q e ) and lim^oc T)p (n, p, q e ) ob- 
tained in this section. For the convenience of comparison, we 
plot them in Figure @] The dashed curves represent the lower 
and upper bounds to lim^oo r) a (n,p, q e ), and the solid curves 
represent the lower and upper bounds to lirrin^oo r]p(n,p, q e ). 



9 



The gap between the bounds for the two codes indicate the 
potential improvement in efficiency (code rate) by using the 
nonuniform codes (compared to using uniform codes) when 
the codeword length is large. We see that the upper bound in 
Theorem [T3l is also the capacity of the Z-channel, derived in 
[20|. It means that nonuniform codes may be able to achieve 
the Z-channel capacity as n becomes large, while uniform 
codes cannot (here we assume that they have codewords of 
high weights and worst-case performance is considered, so the 
constructions of uniform codes cannot achieve the capacity of 
Z-channel). 

V. Layered Codes Construction 

In lfl4l . Kl0ve summarized some constructions of uniform 
codes for correcting asymmetric errors. The code of Kim 
and Freiman was the first one constructed for correcting 
multiple asymmetric errors. Varshamov |[T9l and Constrain 
and Rao [5j presented some constructions based group theory. 
Later, Delsarte and Piret [6| proposed a construction based 
on 'expurgating/puncturing' with some improvements given 
by Weber et al. 11231 . It is natural for us to ask whether it 
is possible to construct nonuniform codes based on existing 
constructions of uniform codes. In this section, we propose a 
general construction of nonuniform codes based on multiple 
layers. It shows that the sizes of the codes can be significantly 
increased by equalizing the reliability of all the codewords. 

A. Layered Codes 

Let us start from a simple example: Assume we want to 
construct a nonuniform code with codeword length n = 10 
and 

f for w = 0, 
ti(w) = I 1 for 1 < w < 5, 
[2 for 6 < w < 10. 

In this case, how can we construct a nonuniform code ef- 
ficiently? Intuitively, we can divide all the codewords into 
two layers such that each layer corresponds to an individual 
uniform code, namely, we get a nonuniform code 

C = {xe{0,i}"|w(x)<5,xeCi} 
|J{xe{o,i} n Kx) >6,xeC 2 }, 

where C\ is a uniform code correcting 1 asymmetric error and 
C 2 is a uniform code correcting 2 asymmetric errors. So we 
can obtain a nonuniform code by combining multiple uniform 
codes, each of which corrects a number of asymmetric errors. 
We call nonuniform codes constructed in this way as layered 
codes. However, the simple construction above has a problem 
- due to the interference of neighbor layers, the codewords at 
the bottom of the higher layer may violate our requirement of 
reliability, namely, they cannot correct sufficient asymmetric 
errors. To solve this problem, we can construct a layered code 
in the following way: Let us first construct a uniform code 
correcting 2 asymmetric errors. Then we add more codewords 
into the code such that 

1) The weights of these additional codewords are less than 
4 = 6 — t±(6). This condition can guarantee that in 



12 - 




-2 - 

10 20 30 40 50 

Weight of Codeword 

Fig. 5. A demonstration of function and t;. 

the resulting nonuniform code all the codewords with 
weights at least 6 can tolerate 2 errors. 
2) These additional codewords are selected such that the 
codewords with weights at most 5 can tolerate 1 error. 

B. Construction 

Generally, given a nondecreasing function ij., we can get 
a nonuniform code with t\,(n) layers by iterating the process 
above. Based on this idea, given n, we construct layered 
codes as follows. 

Let k = t±(n) and let Ci,...,Ck be k binary codes of 
codeword length n, where 

Ci D ... D C k , 

and for 1 < t < k, the code Ct can correct t asymmetric 
errors. Given we can construct a layered code C such that 

C = {xe{0,ir|xeC tlWx)) }, 

where 

ti(w(-x.)) = tj.(maxi?„, (x )) 

= t i (max{s|s - ti(s) < w(x)}). 

We see that there is a shift of the layers (corresponding to 
the function ti and the function t^), see Figure [5] as a demon- 
stration. The following theorem shows that the construction 
above satisfies our requirements of nonuniform codes, i.e., it 
corrects asymmetric errors. 

Theorem 14. Let C be a layered code based on the above 
construction, then for all x G C, x can tolerate ij.(u;(x)) 
asymmetric errors. 

Proof: We prove that for all x, y e C with x 7^ y, 
£>(x) p| B(y) = <p. W.l.o.g., we assume w(x) > w(y). 

If w(x) — t^(w(x)) > w(y), the conclusion is true. 

If uj(x) — t^(w(x)) < w(y) and u>(x) > w(y), then 
x, y G Ct((to(y))- That means there does not exist a word 
z e {0,1}" such that x, y > z and iV(x,z) < U(w(y)) 
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and N(y,z) < U(w(y)). Since w(x) - t±(w(x)) < w(y), 
according to the definition of ti, it is easy to get ti(w(y)) > 
ti(w(x)) > ti(w(y)). So there does not exist a word 
z G {0,1}" such that x, y > z and iV(x,z) < t±(w(x.)) 
and N(y, z) < t±(w(y)), namely, B(x) f] B{y) = <f>. 

This completes the proof. ■ 

We see that the constructions of layered codes are based 
on the provided group of codes C\ , ... , such that C\ D 
C2 D ■■• D Cfc and for 1 < t < k, and the code Ct corrects t 
asymmetric errors. Examples of such codes include Varshamov 
codes lfl9l . BCH codes, etc. 

The construction of Varshamov codes can be described as 
follows: Let a\, aa, a n be distinct nonzero elements of 
F q , and let a := (a%, a>2, — , a„). For x = (xt, x%, x n ) G 
{0, 1}™, let xa = (xiai,x 2 ck2, ...,a; n a n ). For .91,52, — ,gt G 
F g and 1 < t < k, let 

C t := {x g {0, l} n |tn(xa) = 5 ; for 1 < I < t}, 

where the elementary symmetric function 07(11) for I > are 
defined by 

r 00 

IJ(jS + tii) = ^2<Ji{u)z r ~ l . 

»=i z=o 

Then Ct can correct i asymmetric errors (for 1 < t < k), and 
Ci D C 2 D ... D C fc . 

Such a group of codes can also be constructed by BCH 
codes: Let (a>o, a±, a„_i) be n distinct nonzero elements 
of G 2m with n = 2 m - 1. For 1 < t < k, let 

n 

C t := {x G {0, 1}"| Sioi M-1) = for 1 < I < t}. 

i=l 

C. Decoding Algorithm 

Assume x is a codeword in C t and y = x + e is a received 
erroneous word with error vector e, then there is an efficient 
algorithm to decode y into a codeword, which is denoted by 
D t (y). If y has at most t asymmetric errors, then D t {y) = x. 
We show that the layered codes proposed above also have an 
efficient decoding algorithm if Dt(-) (for 1 < t < k) are 
provided and efficient. 

Theorem 15. Let C be a layered code based on the above 
construction, and let y = x + e be a received word such 
that x G C and |e| < <j,(w(x)). To recover x from y, 
we enumerate the integers in [ti(w(y)),ti(w(y) +ti(w(y)))]. 
If we can find an integer t such that Dt(y) G C and 
AT(A(y),y) < h(w(D t (y))), then D t (y) = x. 

Proof: If we let t = i^(u>(x)), then we can get that t 
satisfies the conditions and D t (y) — x. So such t exists. 

Now we only need to prove that once there exists t satis- 
fying the conditions in the theorem, we have D t (y) — x. We 
prove this by contradiction. Assume there exists t satisfying 
the conditions but z = D t (y) ^ x. Then N(z,y) < t±(w(z)). 
Since we also have AT(x, y) < t(w(x)), B(x)P)B(z) 7^ <f>, 
which contradicts the property of the layered codes. 

This completes the proof. ■ 



TABLE II 

BCH CODES WITH CODEWORD LENGTH 255 fTOl 



n 


k 


t 


n 


k 


t 


255 


247 


1 


255 


115 


21 


255 


239 


2 


255 


107 


22 


255 


231 


3 


255 


99 


23 


255 


223 


4 


255 


91 


25 


255 


215 


5 


255 


87 


26 


255 


207 


6 


255 


79 


27 


255 


199 


7 


255 


71 


29 


255 


191 


8 


255 


63 


30 


255 


187 


9 


255 


55 


31 


255 


179 


10 


255 


47 


42 


255 


171 


11 


255 


45 


43 


255 


163 


12 


255 


37 


45 


255 


155 


13 


255 


29 


47 


255 


147 


14 


255 


21 


55 


255 


139 


15 


255 


13 


59 


255 


131 


18 


255 


9 


63 


255 


123 


19 









In the above method, to decode an erroneous word y, we 
can check all the integers between ti(w(y)) and ti(w(y) + 
ti(w(y))) to find the value of t. Once we find the integer 
t satisfying the conditions in the theorem, we can decode y 
into Dt(y) directly. (Note that the length of the interval for 
t, namely ti(w(y) + ti(w(y))) — ti(w(y)), is normally much 
smaller than w(y). It is approximately (jz^p w (y) f° r i-i-d. 
errors when w(y) is large.) We see that this decoding process 
is efficient if D t (.) is efficient for 1 < t < k. 

D. Layered vs.Uniform 

Typically, nonlinear codes, like Varshamov codes are su- 
perior to BCH codes. But it is still not well-known how to 
estimate the sizes of Varshamov codes and their weight dis- 
tributions. To compare uniform constructions and nonuniform 
constructions for correcting asymmetric errors, we focus on 
BCH codes, namely, we compare normal BCH codes with 
layered BCH codes. Here, we consider i.i.d. errors, and we 
assume that the codeword length is n = 255, the crossover 
probability is p and the maximal tolerated error probability is 
q e - 

Table [II] shows the relations between the dimension k 
and the number of errors t that can be corrected in BCH 
codes when n = 255. According to |fT31l , many BCH codes 
have approximated binomial weight distribution. So given an 
(255, fe, t) BCH code, the number of codewords of weight i 
is approximately 




For a normal BCH code, it has to correct t errors with 

* = min{s G N\J2 ("V (! - pT^ > 1 - Qe}, 

then it has 2 k codewords where k can be obtained from table 
[D] based on the value of t. 
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Fig. 6. The estimated rates of BCH codes and layered BCH codes when 
n = 255, g e = Id -4 . 



For a layered BCH code, the codewords with Hamming 
weight w have to correct ti(w) asymmetric errors such that 



t^w) = mm{s EN\^2 ( ™ W(l - p)^ > 1 - q e } 



for all < w < n. Based on the approximated weight 
distribution of BCH codes, the number of codewords in a 
layered BCH codes can be estimated by summing up the 
numbers of codewords with different weights. 

Figure |6]plots the estimated rates of BCH codes and layered 
BCH codes for different p when n = 255 and q e = 10~ 4 . 
Here, for a code C, let #C be the number of codewords, then 
the rate of C is defined as log 2 (#c) p rom figure, we see 
that under the same parameters (n,p, q e ), the rates of layered 
BCH codes are much higher than those of BCH codes. By 
constructing nonuniform codes instead of uniform codes, the 
code rate can be significantly increased. Comparing Figure [6] 
with Figure [3] it can be seen that the rates of layered BCH 
codes are very close to the upper bounds of uniform codes. 
It implies that we can gain more by considering nonuniform 
codes rather than nonlinear uniform codes. 



VI. Flipping Codes Construction 

Many nonlinear codes designed to correct asymmetric errors 
like Varshamov codes are superior to linear codes. However, 
they do not yet have efficient encoding algorithms, namely, it is 
not easy to find an efficient encoding function / : {0, l} fc — > C 
with k [log |C|J. In this section, we focus on the approach 
of designing nonuniform codes for asymmetric errors with 
efficient encoding schemes, by utilizing the well-studied linear 
codes. 

A simple method is that we can use a linear code to correct 
t\(n) asymmetric errors directly, but this method is inefficient 
not only because the decoding sphere for symmetric errors is 
greater than the sphere for asymmetric errors (and therefore 
an overkill), but also because for low-weight codewords, the 



number of asymmetric errors they need to correct can be much 
smaller than t±(n). 

Our idea is to build a flipping code that uses only low-weight 
codewords (specifically, codewords of Hamming weight no 
more than ~ because they need to correct fewer asym- 
metric errors and therefore can increase the code's rate. In the 
rest of this section, we present two different constructions. 

A. First Construction 

First, we construct a linear code C (like BCH codes) 
of length n with generator matrix G that corrects i^(L§J) 
symmetric errors. Assume the dimension of the code is k. 
For any binary message u E {0,1} , we can map it to a 
codeword x in C such that x = uG. Next, let x denote a word 
obtained by flipping all the bits in x such that if x. L = then 
Xi = 1 and if a;, = 1 then Xi = 0; and let y denote the final 
codeword corresponding to u. We check whether w(x) < [_§ J 
and construct y in the following way: 

X00...0 if iu(x) < LfJ, 
xll...l otherwise. 

Here, the auxiliary bits (0s or Is) are added to distinguish that 
whether x has been flipped or not, and they form a repetition 
code to tolerate errors. 

The corresponding decoding process is straightforward: 
Assume we received a word y'. If there is at least one 1 in the 
auxiliary bits, then we "flip" the word by changing all 0s to 
Is and all Is to 0s; otherwise, we keep the word unchanged. 
Then we apply the decoding scheme of the code C to the first 
n bits of the word. Finally, the message u can be successfully 
decoded if y' has at most J) errors in the first n bits. 

B. Second Construction 

In the previous construction, several auxiliary bits are 
needed to protect one bit of information, which is not very 
efficient. Here we try to move this bit into the information 
part of the codewords in C. This motivates us to give the 
following construction. 

Let C be a systematic linear code with length n that corrects 
t' symmetric errors (we will specify t' later). Assume the 
dimension of the code is k. Now, for any binary message 
u e {0, l} fe_1 of length k — 1, we get u' = Ou by adding one 
bit in front of u. Then we can map u' to a codeword x in 
C such that 

x = (0u)G = Ouv, 

where G is the generator matrix of C in systematic form and 
the length of v is n — k. Let a be a codeword in C such that 
the first bit oti = 1 and its weight is the maximal one among 
all the codeword in C, i.e., 

a = are max u>(x). 

xec,n=i 

Generally, w(a) is very close to n. For example, in any primite 
BCH code of length 255, a is the all-one vector; also we can 
construct LDPC codes that include the all-one vector as long as 
their parity-check matrices have even number of ones in each 
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column. In order to reduce the weights of the codewords, we 
use the following operations: Calculate the relative weight 

w(x.\a) — \{l < i < n\xi = l,aa = 1}|. 

Then we get the final codeword 

_ f x + a if w(x\a) > 
\ x otherwise, 

where + is the binary sum, so x + a is to flip the bits in x 
corresponding the ones in a. So far, we see that the maximal 
weight for y is [n — ^ j ■ That means we need to select t' 
such that 



For many linear codes, a is the all-one vector, so t 1 = ti(\j£ J). 

In the above encoding process, for different binary mes- 
sages, they have different codewords. And for any codeword 
y, we have y G C. That is because either y = x or y = x+a, 
where both x and a are codewords in C and C is a linear code. 
So the resulting flipping code is a subset of code C. 

The decoding process is very simple: Given the received 
word y' = y + e, we can always get y by applying the 
decoding scheme of the linear code C if |e| < t'. If t/i = 1, 
that means x has been flipped based on a, so we have 
x = y + a; otherwise, x = y. Then the initial message 
u = x 2 x 3 ...x k . 

We see that the second construction is a little more efficient 
than the first one, by moving the bit that indicates flips from 
the outside of a codeword (of an error-correcting code) to the 
inside. Here is an example of the second construction: Let C 
be the (7, 4) Hamming code, which is able to correct single- 
bit errors. The generating matrix of the (7, 4) Hamming code 

/ 1 1 1 \ 

10 10 1 

1 1 1 

V 1 1 1 1 / 

Here we have t' = 1 and k = 4. Assume the binary message 
is u = 011, then we have x = (0u)G = 0011100. It is easy to 
see that a is the all-one codeword, i.e., a — 1111111. In this 
case, u>(x|a) <= so the final codeword y = 0011100. 

Assume the binary message is u = 110, then we have x = 
(0u)G = 0110110. In this case, w(x\a) > ^2, so the final 
codeword y = x + a = 1001001. 

Assume the received word is y' = 0001001. By applying 
the decoding algorithm of Hamming codes, we get y = 
1001001. Since y\ = 1, we have x = y + a, and as a result, 
u = 110. 

C. Flipping vs. Layered 

When n is sufficiently large, the flipping codes above 
become nearly as efficient (in terms of code rate) as a 
linear codes correcting t|,(|_§ J) symmetric errors. It is much 
more efficient than designing a linear code correcting t±(n) 
symmetric errors. Note that when n is large and p is small, 
these codes can have very good performance on code rate. 
That is because when n is sufficiently large, the rate of an 
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Fig. 7. The estimated rates of flipping/layered BCH codes when n = 
255,g e = 10" 4 . 

optimal nonuniform code is dominated by the codewords with 
the same Hamming weight wg (< £)> and Wd approaches ^ 
as p gets close to 0. We can intuitively understand it based on 
two facts when n is sufficiently large: (1) There are at most 
n 2 n ( H ( — )+ s ) codewords in this optimal nonuniform code. (2) 
When p becomes small, we can get a nonuniform code with at 
least 2™( 1 ~' 5 ) codewords. So when n is sufficiently large and 
p is small, we have Wd — > §■ Hence, an optimal nonuniform 
code has almost the same asymptotic performance with an 
optimal weight-bounded code (Hamming weight is at most 
n/2) that corrects ij.(n/2) asymmetric errors. 

Let us consider a flipping BCH code based on the second 
construction. Similar as the previous section, we assume that 
the codeword length is n — 255 and the number of codewords 
with weight i can be approximated by 

(n\ 

2" ' 

where k is the dimension of the code. Figure [7] compares the 
estimated rates of flipping BCH codes and those of layered 
BCH codes when n = 255 and q e = 10 -4 . Surprisingly, 
the flipping BCH codes achieves almost the same rates as 
layered BCH codes. Note that, for the layered codes, we are 
able to further improve the efficiency (rates) by replacing BCH 
codes with Varshamov codes, i.e., based on layered Varshamov 
codes. 

VII. Extension to Binary Asymmetric Channels 

In the previous sections, we have introduced and studied 
nonuniform codes for Z-channels. The concept of nonuniform 
codes can be extended from Z-channels to general binary 
asymmetric channels, where the error probability from to 
1 is smaller than the error probability from 1 to but it 
may not be ignorable. In this case, we are able to construct 
nonuniform codes correcting a big number of 1 — > errors 
and a small number of — >• 1 errors. Such codes can be 
used in flash memories or phase change memories, where the 
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Fig. 8. A demonstration of x,y,x',y'. 



change in data has an asymmetric property. For example, the 
stored data in flash memories is represented by the voltage 
levels of transistors, which drift in one direction because of 
charge leakage. In phase change memories, another class of 
nonvolatile memories, the stored data is determined by the 
electrical resistance of the cells, which also drifts due to 
thermally activated crystallization of the amorphous material. 
This asymmetric property will introduce more 1 — > errors 
than — !> 1 errors after a long duration. 

In this section, we first investigate binary asymmetric chan- 
nels where the probability from to 1 is much smaller than 
that from 1 to 0, namely, pf <C p\, but pf is not ignorable. 
In this case, we can let tf be a constant function. Later, we 
consider general binary asymmetric channels, where tf can be 
an arbitrary nonincreasing step function. 



A. tf Is a Constant Function 

We show that if tf is a constant function, then correcting 
[tj.,^] errors is equivalent to correcting t± + tf asymmet- 
ric errors, where t± can be an arbitrary step functions on 
{0,l,...,n}. 

Theorem 16. Let tf be a constant function, a code C is a 
nonuniform code correcting \t\.,t\\ errors if and only if it is 
a nonuniform code correcting t± + tf asymmetric errors. 

Proof: 1) We first show that if C is a nonuniform code 
correcting [i^tf] errors where tf is a constant function, then 
it can correct tf + tf asymmetric errors. We need to prove that 
there does not exists a pair of codewords x, y 6 C such that 

N(x,y) <t|Hx))+tt, 



where 



iV(y,x) <t±(w(y))+tf, 



^(x,y)4|{i:xi = l >W = 0}|. 



Let us prove it by contradiction. Assume that their exists a 
pair of codewords x, y that satisfy the inequalities above. By 
adding at most — > 1 errors, we get a vector x' from x such 
that the Hamming distance between x' and y is minimized; 
also we get a vector y' from y such that the Hamming distance 



between y' and x is minimized. In this case, we only need to 
show that 

AT(x',y') < ^Hx)),JV(y',x') < t±(w(y)), 

which contradicts with our assumption that C can correct 
[ij,,^] errors. The intuitive way of understanding x',y' is 
shown in Figure [S] In the figure, we present each vector as a 
line, in which the solid part is for Is and the dashed part is 
for 0s. 

If iV(x',x) < tf and N(y',y) < tf, then 

x\ = max(a; i ,y l ) = y[, 

so x' = y'. The statement is true. 

If JV(x',x) < f t and iV(y',y) = if, then y' < x'. In this 
case, 

A/(x',y')<A r (x,y)-i t <t i ( W (x)). 

We get the statement. 

Similarly, if N(y',y) < tf and N(x.',x) — tf, we have 
x' < y' and 

Ar(y',x')<iV(y,x)-t t <^( W (y)). 

If iV(x',x) = tf and N(y',y) = tf, we can get 

iV(x',y')<iV(x,y)-i t <t i ( W (x)), 

iV(y',x')<iV(y,x)-t t <^Ky)). 

Based on the discussions above, we can conclude that if C 
is a nonuniform code correcting [i|,,£-|-] errors where tf is a 
constant function, then it is also a nonuniform code correcting 
t\ + tf asymmetric errors. 

2) We show that if C is a nonuniform codes correcting 
t\ + tf asymmetric errors where tf is a constant function, 
then it is also a nonuniform code correcting [t^if] errors. 
That means for any x, y 6 C, there does not exist a vector v 
such that 

A/(v,x)<t t , JV(x,v) <^0(x)), 

N(v,y)<tf, N(y,v) <U(w(y)). 

Let us prove this by contradiction. We assume there exists 
a vector v satisfies the above conditions. Now, we define a 
few vectors x' , y' , u such that 

x i = min(a;,, v{) VI < i < n, 

y'i = min(yi, Vi) VI < i < n, 

m = mm(xi,yi> v t ) VI < i < n. 

The intuitive way of understanding these vectors is shown in 
Figure [9] In the figure, we present each vector as a line, in 
which the solid part is for Is and the dashed part is for 0s. 
Then 

x' < x, x' < v, iV(x, x') < tj.(tu(x)), AT(v, x') < tf, 
y' < y,y' < v, N{y,y') < t±(w(y)),N(v,y') < tf. 
Now we want to show that 

iV(x,u) <tf(w(x))+tf. 
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Since 

AT( X , u) < AT( X , x') + N(x', u), 
we only to show that 

iV(x',u) < t t . 

According to the definition of u, it is easy to get that 

Ar(v,x') + jV(x',u) = iV(v,y') + iV(y» 
< iV(v,x')+iV(v,y') 

So N(x' , u) < t^, which leads us to 

iV(x,u) <^(w(x))+i t . 

Similarly, we can also get 

N(y,u) <f|(ffi(y))+« t . 

In this case, C is not a nonuniform codes correcting ij_ + 
asymmetric errors, which contradicts with our assumption. 

Based on the discussions above, we can get the conclusion 
in the theorem. ■ 

According to the above theorem, all our results for Z- 
channels, like upper bounds and constructions of nonuniform 
codes, can apply to nonuniform codes correcting [t|_, t^] errors 
if t-f is a constant function. 

B. if Is a Nonincreasing Function 

Another case of binary asymmetric channel is that p^ < p^ 
but p^ is not much smaller than p±. In this case, it is not 
efficient to write as a constant function. Instead, we consider 
it as a nonincreasing step function. 

Theorem 17. Let be a nondecreasing function and be 
a nonincreasing function. A code C is a nonuniform code 
correcting [£,[., if] errors if it is a nonuniform code correcting 
+ t\ asymmetric errors. Here, for all < w < n, 

t^(w) — tf (max{s|i-f (s) + s < w — ti(w)}). 

Proof: Let C be a nonuniform code correcting t± + t^- 
errors. For any x, y 6 C, w.l.o.g, we assume w(x) < w(y). 
If u>(x) + t|(ui(x)) < w(y) — ti(w(y)), then there does not 
exist a vector v such that 

AT(v,x)<t t , AT(x,v) <^(w(x)), 



JV(v,y)<it, iV(y,v) <t±(w{y)). 

If w(x) + t^(w(x)) > w(y) — ti(w(y)), according to the 
proof in Theorem [16] we can get that there does not exist a 
vector v such that 

2V(v,x) <t t (u>(x)), 
N(x, v) < *4,(w(x)) + (ib(x)) - «t(w(x)); 
AT(v,y) <t t Hx)), 

JV(y, v) < *;(w(y)) + tf (w(y)) - *tMx)). 

Since 

t^Wx))-( t Wx))>0, 
i t Hx)) > tt(tt(y)), 

tf(w(y)) > ttM*))> 

we can get that there does not exist a vector v such that 

Ar(v,x)<i t , JV(x,v) <t 4 (w(x)), 
JV(v,y)<*t. %,v)<i|(w(y)), 

Finally, we conclude that C is a nonuniform code correcting 
[t±,tf] en-ors. ■ 

According to the above theorem, we can convert the prob- 
lem of constructing a nonuniform codes for an arbitrary binary 
asymmetric channel to the problem of constructing a nonuni- 
form correcting only 1 — > errors. Note that this conversion 
results in a little loss of code efficiency, but typically it is 
very small. Both layered codes and flipping codes can be 
applied for correcting errors in binary asymmetric channels. 
A little point to notice is that t± + tf might not be a strict 
nondecreasing function of codeword weight. In this case, we 
can find a nondecreasing function th which is slightly larger 
than ti + tf, and construct a nonuniform code correcting th 
asymmetric errors. 

When we apply flipping codes for correcting errors in 
binary asymmetric channels, we do not have to specify t\. 
and t-f separately. For example, assume that i.i.d. errors are 
considered. If the maximal tolerated error probability is q e , 
then given a codeword of weight w, it has to tolerate total 
tf(w) errors. For < w < n, tf(w) can be obtained by 
calculating the minimal integer t such that 
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To construct a flipping code, we only need to find a linear code 
such that it corrects tf(\n — §J) symmetric errors, where a 
is the codeword with the maximum weight in the linear code. 

Theorem 18. Let t± be a nondecreasing function and f-j- be 
a nonincreasing function. If a code C is a nonuniform code 
correcting [t^ , i-j-] errors, then it corrects t± + t^ asymmetric 
errors. Here, 

t^{w) — t-j-(min{s|s — i-f-(s) — £j,(s) < w}). 
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Proof: The proof of this theorem is very similar as the 
proof for the previous theorem. It follows the conclusion in 
Theorem [16] ■ 

According to the theorem above, to calculate the upper 
bound of nonuniform codes correcting [t±, tf] errors, we can 
first calculate the upper bound of nonuniform codes correcting 
ti + tf asymmetric errors. Generally speaking, nonuniform 
codes correcting [tj.,tf] errors (considering the optimal case) 
are more efficient than nonuniform codes correcting t\, + tf 
asymmetric errors, but less efficient than those correcting 
tf + tf asymmetric errors. According to the definitions of tf 
and tf(w), it is easy to get that 

tf(w) < tf(w) < tf(w), 

for < w < n. Typically, if pi,pf <C 1, then tf(w) — 
tf(w) <C tf(w). It implies that nonuniform codes correcting 
[tj., tf] errors are roughly as efficient as those correcting 
ti + tf asymmetric errors. If we consider i.i.d. errors and long 
codewords, it is equally difficult to correct errors introduced 
by a binary asymmetric channel with crossover probabilities 
Pl and pf or a Z-channel with a crossover probability p^ +pf. 

VIII. Concluding Remarks 

In storage systems with asymmetric errors, it is desirable to 
design error-correcting codes such that the reliability of each 
codeword is guaranteed in the worst case, and the size of the 
code is maximized. This motivated us to propose the concept 
of nonuniform codes, whose codewords can tolerate a number 
of asymmetric errors that depends on their Hamming weights. 
We derived an almost explicit upper bound on the size of 
nonuniform codes and compared the asymptotic performances 
of nonuniform codes and uniform codes - it is evident that 
there is a potential performance gain by using nonuniform 
codes. In addition, we presented two general constructions of 
nonuniform codes, including layered codes and flipping codes. 
Open problems include efficient encoding for layered codes 
and the construction of flipping codes when p is not small. In 
general, the construction of simple and efficient nonuniform 
codes is still an open problem. 
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